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Abstract 

Background: One of the challenges in insect chemical ecology is to understand how insect pheromones are 
synthesised, detected and degraded. Genome wide survey by comparative sequencing and gene specific 
expression profiling provide rich resources for this challenge. A ipsilon is a destructive pest of many crops and 
further characterization of the genes involved in pheromone biosynthesis and transport could offer potential 
targets for disruption of their chemical communication and for crop protection. 

Results: Here we report 454 next-generation sequencing of the A ipsilon pheromone gland transcriptome, 
identification and expression profiling of genes putatively involved in pheromone production, transport and 
degradation. A total of 23473 unigenes were obtained from the transcriptome analysis, 86% of which were 
A ipsilon specific. 42 transcripts encoded enzymes putatively involved in pheromone biosynthesis, of which 15 were 
specifically, or mainly, expressed in the pheromone glands at 5 to 120-fold higher levels than in the body. Two 
transcripts encoding for a fatty acid synthase and a desaturase were highly abundant in the transcriptome and 
expressed more than 40-fold higher in the glands than in the body. The transcripts encoding for 2 acetyl-CoA 
carboxylases, 1 fatty acid synthase, 2 desaturases, 3 acyl-CoA reductases, 2 alcohol oxidases, 2 aldehyde reductases 
and 3 acetyltransferases were expressed at a significantly higher level in the pheromone glands than in the body. 
17 esterase transcripts were not gland-specific and 7 of these were expressed highly in the antennae. Seven 
transcripts encoding odorant binding proteins (OBPs) and 8 encoding chemosensory proteins (CSPs) were 
identified. Two CSP transcripts {AipsCSP2, AipsCSP8) were highly abundant in the pheromone gland transcriptome 
and this was confirmed by qRT-PCR. One OBP (AipsOBP6) were pheromone gland-enriched and three OBPs 
(AipsOBPl, AipsOBP2 and AipsOBP4) were antennal-enriched. Based on these studies we proposed possible A ipsilon 
biosynthesis pathways for major and minor sex pheromone components. 

Conclusions: Our study identified genes potentially involved in sex pheromone biosynthesis and transport in 
A. ipsilon. The identified genes are likely to play essential roles in sex pheromone production, transport and 
degradation and could serve as targets to interfere with pheromone release. The identification of highly expressed 
GSPs and OBPs in the pheromone gland suggests that they may play a role in the binding, transport and release of 
sex pheromones during sex pheromone production in A ipsilon and other Lepidoptera insects. 
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Background 

Lepidoptera sex pheromones are primarily C10-C18 long 
straight chain unsaturated alcohols, aldehydes or acetate 
esters [1], biosynthesised and released mainly from 
pheromone glands located between the 8 th and 9 th ab- 
dominal segments of the female moths. Usually the fe- 
males use a mixture of compounds in a unique ratio to 
attract conspecific males [2]. The extremely high specifi- 
city and sensitivity of species-specific pheromones make 
them potential biological control agents for population 
monitoring, mass trapping and reducing pesticide use in 
integrated pest management (IPM) programs [3-5]. Fur- 
ther use of pheromones in such strategies would be 
aided by an understanding of the pathways involved in 
pheromone biosynthesis and transport. 

Most sex pheromone blends of Lepidoptera insects are 
synthesised de novo via modified fatty acid biosynthesis 
pathways [2,6,7] and gland-specific enzymes are involved 
in desaturation, chain shortening, reduction and acetyl- 
ation [1,2]. Different species use different combinations 
of these reactions to produce unique species-specific 
pheromone blends. The first step is the synthesis of sat- 
urated fatty acid precursors malonyl-CoA from acetyl- 
CoA by acetyl- Co A carboxylase (ACC) and fatty acid 
synthetase (FAS) [8,9]. Labeling studies conducted with 
acetate indicated that malonyl-CoA and NADPH are 
used by FAS to produce mainly saturated stearic acid 
(18:0) and palmitic acid (16:0) with 18 and 16 carbon 
atoms and no double bonds, respectively, as precursors 
[10-12]. Modification of the fatty acid chain includes the 
introduction of a double bond by desaturases specific to 
pheromone biosynthesis followed by chain shortening 
using specific p-oxidation enzymes [13,14]. So far, sev- 
eral types of desaturases have been extensively studied 
through gene characterization and expression analysis, 
including A5 [15], A9 [16,17], A10 [18], All [19,20], and 
A14 [21] desaturases. Once unsaturated pheromone pre- 
cursor with a specific chain-length is produced, the carb- 
oxyl carbon is modified to form one of functional 
groups (aldehyde, alcohol or acetate ester). These modifi- 
cations require the enzymes fatty acid reductase to pro- 
duce the alcohols from the fatty acyl precursor [22], which 
in some species may be oxidized to aldehydes serving as 
pheromone components [23], and to acetate esters (OAc) 
by acetyltransferase [24]. Recently, a few members of the 
reductase gene family have been discovered and function- 
ally characterized in several Lepidoptera species, including 
Ostrinia scapulalis [25], Heliothis virescens, Heliothis 
subflexa, Helicoverpa armigera, Helicoverpa assulta 
[26], Ostrinia nubilalis [27], Yponomeuta evonymellus 
(L.), Yponomeuta padellus (L.) and Yponomeuta rorellus 
(Hubner) [28]. A number of pheromone gland-specific 
enzymes have been identified and their essential func- 
tions in pheromone production demonstrated in vitro 



as well as in vivo. For example, using RNA interference, 
Matsumoto and colleagues showed that two pheromone 
gland-specific enzymes (acyl-CoA desaturase and a 
fatty-acyl reductase) are responsible for pheromone 
production in the silk moth Bombyx mori [29-31]. 

After production and release of the sex pheromone com- 
ponents by female moths the males detect the pheromone 
and respond for mating. It is commonly accepted that 
pheromone molecules are captured and transported to the 
pheromone receptors on the dendrites of pheromone- 
sensitive neurons by olfactory binding proteins, including 
odorant binding proteins (OBPs) and chemosensory pro- 
teins (CSPs) [32-34]. Pheromone binding proteins (PBPs) 
bind to sex pheromone components and classified into a 
subclass of OBPs [35]. After activation of the pheromone 
receptors the olfactory signals must be degraded rapidly to 
prevent from prolonged neuronal excitation [36]. This may 
involve pheromone degrading enzymes (PDEs) capable of 
degrading the pheromone molecules [37]. 

The black cutworm Agrotis ipsilon is a destructive pol- 
yphagous insect pest of many crops and for a strain from 
China the female sex pheromone blend comprises five main 
acetate components: (Z)-ll-hexadecenyl acetate (Zll-16: 
OAc), (Z)-9-tetradecenyl acetate (Z9-14:OAc), (Z)-7- 
dodecenyl acetate (Z7-12:OAc), (Z)-8-dodecenyl acetate 
(Z8-12:OAc) and (Z)-5-decenyl acetate (Z5-10:OAc) [38]. 
These components indicate the involvement of different 
desaturases and E-oxidases during the sex pheromone bio- 
synthesis. However, the genes/proteins and their specific 
function in mediating A. ipsilon pheromone production, 
transport and degradation have not been characterized. 
Over the last few years, the next generation sequencing 
such as 454 pyrosequencing technique provides an easy 
and effective method for the discovery of novel genes. In 
present study, using the Roche GS FLX Titanium sequen- 
cing platform, we report a genetic database of the genes 
expressed in the pheromone glands of A. ipsilon and the 
identification of genes with putative roles in pheromone 
biosynthesis, degradation and transport as well as their tis- 
sue expression profiles. 

Results and discussion 

454 sequencing and unigene assembly 

Sequencing of a cDNA library prepared from mRNAs of 
the pheromone glands of A. ipsilon gave a total of 631,425 
raw reads with an average length of 517 base pairs (bp). 
After trimming adaptor sequences and removing low 
quality sequences, 629,273 clean reads remained with an 
average length of 496 bp. The size distribution of the clean 
reads is shown in Additional file 1. The sequences of all 
reads have been deposited in the NCBI SRA database with 
the accession number SRX189143. 

The 629,273 clean reads were assembled into 23,473 
unigenes, including 20,541 contigs (87.5%) and 2,932 
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singletons (12.5%), the largest transcriptome dataset so far 
from moth sex pheromone glands. An overview of the se- 
quencing and assembly results is presented in Table 1. 
The length of the assembled unigenes ranged from 100 bp 
to 21842 bp with an average length of 770 bp. Among the 
unigenes, 22,035 (93.9%) are between 200 bp and 2000 bp 
long with an average length of 649 bp. These unigenes are 
in fact transcripts in the A. ipsilon pheromone gland 
cDNA library. Therefore we refer them as transcripts. All 
sequences of the unigenes used in the current study are 
provided in Additional file 2. 

Analysis of the transcripts from the A. ipsilon pheromone 
gland 

BLASTx and BLASTn were used to compare each 

A. ipsilon transcript with a cut-off E-value of 1.0E-5 
against GeneBank entries. 12,989 transcripts (55%) had 
BLASTx hits in the non-redundant protein (nr) databases 
and 9,392 (40%) had BLASTn hits in the non-redundant 
nucleotide sequence (nt) databases. This is consistent with 
a previous report of H. virescens pheromone gland ESTs 
[39]. Some of the A. ipsilon transcripts were homologous 
to those from more than one species but in general most 
were homologous to other Lepidoptera species taking up 
2,379 in the 9,392 BLASTn hits, including 1,124 (12%) to 

B. mori entries. The second highest hits were to Dipteran 
species with 343 hits to D. melanogaster and 279 and 221 
hits to the mosquitoes Anopheles gambiae and Aedes 
aegypti, respectively. The lowest hits were to the wasp 
Nasonia vitripennis (190 hits), the beetle Tribolium 
castaneum (147 hits) and the pea aphid Acyrthosiphon 
pisum (136 hits). The top 15 insect species that have sig- 
nificant BLASTn hits are shown in Figure 1. 

Gene Ontology of the genes expressed in the A. ipsilon 
pheromone gland 

The 23,473 assembled transcripts were annotated into dif- 
ferent functional groups according to Gene Ontology (GO) 
analysis. Some transcripts were annotated into more than 
one GO category. Of the 22,473 transcripts, 7,546 (32%) 
could be assigned to a GO category (Additional file 3). The 
"cellular process" and "metabolic process" GO categories 

Table 1 Summary of A ipsilon pheromone gland unigene 
sequences and assembly 



Sequence Average length Length distribution 
number (bp) (bp) 



Raw 
reads 


631425 


517 


40-1200 bp 


Clean 
reads 


629273 


496 


40-1200 bp 


singletons 


2932 


421 


100-835 bp 


contigs 


20541 


820 


100-21842 bp 


unigenes 


23473 


770 


100-21842 bp 



were most abundantly represented with 4,056 (17.3%) and 
3,361 (14.3%) transcripts, respectively, within the bio- 
logical process GO ontology. In the "cellular components" 
GO ontology the transcripts were mainly distributed in 
cell (18.8%) (4,415 transcripts) and cell part (17.6%) (4,133 
transcripts). The GO analysis also showed that in the mo- 
lecular function ontology 3,271 transcripts (13.9%) were 
annotated as having binding functions and 3,484 (14.8%) 
to have catalytic activity. 

Comparative analysis of transcripts in Lepidoptera 
pheromone glands 

In order to compare the A. ipsilon pheromone gland tran- 
scriptome with those from other Lepidoptera and to iden- 
tify A. ipsilon transcripts with potential involvement in sex 
pheromone production and transport we downloaded 
the pheromone gland ESTs of three other Lepidoptera 
A. segetum, B. mori and H. virescens from the dbEST data- 
base of NCBI and previously published pheromone gland 
transcriptome of H. virescens [39]. After assembling these 
ESTs we obtained 925 unigenes from A. segetum, 3943 
from B. mori and 8202 from H. virescens with an average 
length of 384 bp, 692 bp and 474 bp, respectively. These 
are much lower numbers than that obtained by the current 
study through the 454 sequencing of the A. ipisilon phero- 
mone gland, demonstrating that our pheromone gland 
transcriptome is currently the largest transcriptome re- 
source for an insect pheromone gland. 

When comparing the pheromone gland transcripts 
pairwise using best bidirectional hits, we found that there 
were 461 homologous transcripts between A. ipsilon 
and A. segetum, 1110 homologous transcripts between 
A. ipsilon and B. mori, and 2106 homologous transcripts 
between A. ipsilon and H. virescens (Figure 2). A large 
portion of A. ipsilon transcripts (86.4%) (20,274 out of 
23,473) had no homologous ESTs in the available phero- 
mone gland EST libraries of the other 3 species. This may 
be due to the larger dataset (23,473 unigenes) for A. ipsilon 
and lower coverage in the other studies. Nevertheless, it 
was shown that 309 transcripts, 5,755 transcripts and 2,556 
transcripts are only found in A. segetum, H. virescen and B. 
mori, respectively, in our comparison (Figure 2). 

Transcript abundance in the A. ipsilon pheromone gland 

The pheromone gland mRNA samples used for 
constructing the cDNA library were non-normalized and 
non-amplified by PCR, so the reads in the sequencing 
dataset most likely represent the relative abundance of 
each assembled transcript in the pheromone gland as 
summarized in Table 2. The most abundant transcripts 
include vitellogenin, a major reproductive protein in in- 
sects (2,925 reads per kilobase per million mapped reads 
(RPKM); 2.2% reads), the precursor of egg yolk proteins 
for insect egg production [40] and genes involved in PBAN 
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Figure 1 Top 15 insect species that have significant BLASTn hits. All A ipsilon pheromone gland unigenes were used in BLASTn searches 
against the GenBank entries. The significant hits with an E-value <=1.0E-5 for each query were grouped according to species and the number of 
the unigenes that had significant homology is indicated after the specie name. 



A. segetum B. mori 




Figure 2 Comparative analysis of A. ipsilon pheromone gland 
unigenes with other insects. This shows the overlap of blast 
homology in genes expressed in pheremone glands in four 
species of Lepidoptera. The comparative analyses of A ipsilon, H. 
virescens, B. mori and A segetum pheromone gland unigenes were 
performed based on the Best Bidirectional Hits results (reciprocal 
BLASTn, E-value less than 1.0E-6). 



stimulated pheromone production such as lipase 3 [41] 
(4,731 RPKM; 0.8% reads) and in sex pheromone bio- 
synthesis such as acyl-CoA desaturase (1,206 RPKM; 0.3% 
reads) and in lipid transport such as apolipophorin III 
(2894 RPKM 0.4% reads). Another highly abundant tran- 
script {Unigene_721) with 1,365 RPKM encodes a CSP with 
a 76% protein identity to the H. virescens CSP (Protein ID: 
ACX53806) and 41% to the ejaculatory bulb-specific pro- 
tein 3 of D. melanogaster (Protein ID: Q9W1C9). 

Candidate genes in the A ipsilon pheromone gland with 
putative functions in pheromone production, transport 
and degradation 

The overall enzymatic steps during pheromone biosynthesis 
in A. ipsilon are likely to be similar to those in other moth 
species, which include fatty acid synthesis, desaturation, 
chain shortening, reduction and acetylation [1,2,6]. By 
homologous searches we identified members of gene 
subfamilies in the A. ipsilon pheromone gland transcrip- 
tome putatively involved in these biosynthetic processes 
and pheromone production, including transcripts putatively 
encoding 3 synthases (2 actyl-CoA carboxylase and 1 fatty 
acid synthase), 5 desaturases, 13 acyl-CoA reductases, 5 
alcohol oxidases and 5 acetyltransferases as well as 11 alde- 
hyde reductases (Table 3); 17 transcripts encoding putative 
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Table 2 The most prevalent mRNAs in A. ipsilon sex pheromone gland 



Unigene ID 


GenBank annotation 


Species 


Unigene_ 


_694 


vitellogenin 


Spodoptera litura 


Unigene. 


_3480 


lipase 3 


Danaus plexippus 


Unigene_ 


.17140 


C-type lectin 5 


Helicoverpa armigera 


Unigene_ 


_2237 


vitellogenin 


Spodoptera lituro 


Unigene. 


_688 


translation elongation factor 2 


Spodoptera exigua 


Unigene_ 


.18106 


apolipophorin precursor 


Bombyx mori 


Unigene_ 


.692 


apolipophorin precursor 


Manduca sexta 


Unigene_ 


.17571 


calcium ATPase 


Heliothis virescens 


Unigene. 


.2742 


A DP/ATP translocase 


Helico verpaarm igera 


Unigene_ 


.17691 


heat shock e protein 70 


Agrotis ipsilon 


Unigene_ 


.728 


cathepsin 


Helicoverpa armigera 


Unigene. 


.7978 


elongation factor 1 alpha 


Papili opolytes 


Unigene_ 


.178 


apolipophorin III 


Trichoplusia ni 


Unigene_ 


.780 


acyl-CoA desaturase 


Helicoverpa assulta 


Unigene_ 


.3446 


myosin regulatory light chain 2 


Bombyx mori 


Unigene_ 


.18120 


fatty acid synthase 


Tribolium castaneum 


Unigene_ 


.721 


Chemosensory protein 


Heliothis virescens 



pheromone degradation enzymes (Table 4); 8 transcripts 
encoding putative CSPs and 7 transcripts encoding putative 
OBPs (Table 5). Their abundances in the pheromone gland 
transcriptome are shown in Figures 3 and 4. We further 
validated and characterized the expression level and the tis- 
sue distribution of these genes by RT-PCR and qRT-PCR 
and summarised below. There is a clear agreement between 
the transcript abundance estimated by the transcriptome 
sequencing and transcript expression level in the phero- 
mone gland as measured by RT-PCR and qRT-PCR. 

Receptor for the pheromone biosynthesis activating 
neuropeptide (PBAN) 

PBAN is released from the suboesophagal ganglion in the 
brain and goes to the hemolymph, where it binds to the 
PBAN receptor in the membrane of the pheromone gland 
and triggers the pheromone production [42,43]. Although 
there was no PBAN receptor found in the pheromone 
gland transcriptome of H. virescens [39] we found one tran- 
script (Unigene_3821) encoding a protein highly homolo- 
gous to PBAN receptor isoform B. It has very low 
abundance in the A. ipsilon transcriptome (31 RPKM) but 
high amino acid identity of 97% to H. virescens PBAN re- 
ceptor in GenBank (Protein IDs: ABU93813) [44]. 

Acetyl-CoA carboxylase (ACQ 

Saturated long chain fatty acids are the precursors of sex 
pheromones in most moth species. Their biosynthesis is 
started by ACC catalysing the production of malonyl-CoA 
from acetyl-CoA in the first committed biosynthesis step 
[8,9]. In the A. ipsilon pheromone gland we found two 



Accession number Score E-value % Identity No. of Reads 



ABU68426 


2195 


0.0 


71% 


10106 


EHJ71865 


344 


5E-108 


47% 


5201 


AFI47450 


411 


4E-141 


62% 


4131 


ABU68426 


673 


0.0 


66% 


3500 


AAL83698 


1565 


0.0 


96% 


3122 


BAK82317 


4076 


0.0 


61% 


2457 


AAB53254 


2536 


0.0 


63% 


2274 


AAD09820 


1868 


0.0 


97% 


2048 




jy 1 


9F 1 1f* 

ZL 1 JO 


yZ /0 


ZsJoy 


AEG78288 


1131 


0.0 


100% 


1832 


NP_075125 


263 


9E-82 


43% 


1783 


BAM18878 


895 


0.0 


99% 


1769 


ABV68867 


288 


2E-96 


88% 


1709 


AF482909 


692 


0.0 


94% 


1691 


NP_001091813 


298 


1E-99 


98% 


1627 


XP_97041 7 


2436 


0.0 


57% 


1486 


ACX53806 


206 


3E-66 


76% 


390 



transcripts (ACC-JX989149 and ACC-JX989150) encoding 
ACCs. ACC-JX989149 with an open reading frame (ORF) 
of 5841 bp encodes for a ACC with 67% amino acid identity 
with the ACC of T. castaneum (Protein ID: XP_969851) 
and ACC-JX989150 encodes a protein with 56% amino acid 
identity with the ACC of H. virescens (Protein ID: 
ACX53705) (Table 3). The RT-PCR and qRT-PCR revealed 
that both ACC-JX989149 and ACC-JX989150 are highly 
expressed in the pheromone gland as compared to the body 
(Figure 5 and Figure 6). However, they have very low abun- 
dance (81 and 21 RPKM) in the transcriptome (Figure 3). 

Fatty acid synthase (FAS) 

FAS has been shown to catalyse the conversion of 
malonyl-CoA and NADPH to produce saturated fatty 
acids [8]. We identified one putative FAS transcript 
(FAS-JX989151) in the A. ipsilon pheromone gland 
(Table 3), containing an ORF of 7176 bp and encoding 
a FAS with 57% amino acid identity to the FAS of 
T. castaneum (Protein ID: XP_970417). The RT-PCR 
and qRT-PCR revealed that FAS-JX9891S1 is highly 
expressed in the pheromone gland (40-fold higher than 
in the body, Figure 5 and Figure 6) and also has a high 
abundance (343 RPKM) in the transcriptome (Figure 3). 

Desaturase (DES) 

Pheromone-specific desaturases introduce double bond(s) 
into the fatty acids at specific positions along the chain. Five 
putative sex pheromone components extracted from A. 
ipsilon sex pheromone gland are unsaturated fatty acids 
with acetate as the functional group and 16 or less carbons 



Table 3 Putative pheromone biosynthesis related genes in the A. ipsilon pheromone gland 



Unigene* 


Accession Num. 


Length (bp) 


No. of Reads 


Putative identification 


Species 


Accession Num. § 


Score (bits) 


E-value 


% Identity 


Acetyl CoA Carboxylase 




















Unigene_2338-ACC 


JX989149 


7534 


305 


acetyl-CoA carboxylase 


Tribolium 
casta neum 


XP_969851 


2644 


0.0 


67% 


Unigene_6244-ACC 


JX989150 


621 


17 


acetyl-CoA carboxylase 


Heliothis virescens 


ACX53705 


169 


2E-47 


56% 


Fatty acid synthase 




















Unigene_18120-FAS 


JX989151 


8703 


2341 


fatty acid synthase 


Tribolium casta neum 


XP_970417 


2436 


0.0 


57% 


Desaturases 




















Unigene_65-DES 


JX989152 


1077 


154 


acyl-CoA delta 9 desaturase 


Mamestra brassicae 


ABX90048 


681 


0.0 


96% 


Unigene_741-DES 


JX989153 


1047 


630 


acyl-CoA delta 1 1 desaturase 


Mamestra brassicae 


ABX90049 


610 


0.0 


87% 


Unigene_780-DES 


JX989154 


1077 


1710 


acyl-CoA desaturase HassNPVE 


Helicoverpa assulta 


AF482909 


692 


0.0 


94% 


Unigene_10494-DES 


JX989155 


254 


3 


desaturase 


Spodoptera littoralis 


AAQ74260 


98.2 


3E-22 


64% 


Unigene_15401-DES 


JX989156 


489 


9 


acyl-CoA desaturaseSexiGATD 


Spodoptera exigua 


AAM28510 


323 


1E-110 


93% 


Fatty acyl reductase 




















Unigene_163-FAR 


JX989157 


1593 


82 


fatty-acyl CoA reductase 6 


Ostrinia nubilalis 


ADI82779 


536 


0.0 


59% 


Unigene_1098-FAR 


JX989158 


948 


43 


putative fatty acyl-CoA reductase 


Nasonia vitripennis 


XP_00 1600309 


261 


2E-119 


51% 


Unigene_2537-FAR 


JX989159 


1560 


105 


putative fatty acyl-CoA reductase 


Bom bus terrestris 


XP_003399879 


585 


0.0 


57% 


Unigene_3905-FAR 


JX989160 


1593 


37 


putative fatty acyl-CoA reductase 


Apis mellifera 


ADI87410 


541 


0.0 


59% 


Unigene_4078-FAR 


JX989161 


1017 


14 


fatty-acyl CoA reductase 3 


Ostrinia nubilalis 


ADI82776 


519 


2E-178 


80% 


Unigene_4302-FAR 


JX989162 


861 


56 


fatty-acyl CoA reductase 6 


Danaus plexippus 


EHJ76493 


372 


8E-174 


61% 


Unigene_6708-FAR 


JX989163 


615 


7 


fatty-acyl CoA reductase 4 


Ostrinia nubilalis 


ADI82777 


342 


7E-113 


77% 


Unigene_7344-FAR 


JX989164 


1380 


43 


fatty-acyl reductase 


Heliothis virescens 


ACX53790 


671 


0.0 


70% 


Unigene_8541-FAR 


JX989165 


375 


4 


fatty-acyl CoA reductase 5 


Ostrinia nubilalis 


ADI82778 


193 


4E-63 


69% 


Unigene_1 1561 -FAR 


JX989166 


182 


2 


fatty-acyl-CoA reductase 


Heliothis virescens 


ACX53773 


104 


2E-26 


82% 


Unigene_12329-FAR 


JX989167 


855 




fatty-acyl CoA reductase 6 


Ostrinia nubilalis 


ADI82779 


315 


3E-101 


66% 


Unigene_12652-FAR 


JX989168 


249 


2 


fatty-acyl-CoA reductase 


Heliothis virescens 


ACX53773 


117 


5E-31 


76% 


Unigene_15351-FAR 


JX989169 


714 


14 


putative fatty acyl-CoA reductase 


Apis mellifera 


ADI87410 


283 


6E-90 


56% 


Unigene 


Accession Num. 


Length (bp) 


No. of Reads 


Putative identification 


Species 


Accession Num. § 


Score (bits) 


f-value 


% Identity 


Alcohol oxidase 




















Unigene_195-AOX 


KC007341 


834 


25 


Putative alcohol dehydrogenase 


Danaus plexippus 


EHJ7061 1 


221 


2E-67 


43% 


Unigene_307-AOX 


KC007342 


969 


56 


Putative alcohol dehydrogenase 


Danaus plexippus 


EHJ7061 1 


327 


3E-106 


55% 


Unigene_397-AOX 


KC007343 


756 


253 


alcohol dehydrogenase 


Heliothis virescens 


ACX53694 


405 


6E-139 


78% 


Unigene_7733-AOX 


KC007344 


576 


6 


Putative alcohol dehydrogenase 


Danaus plexippus 


EHJ7061 1 


219 


7E-68 


64% 


Unigene_10714-AOX 


KC007345 


501 


3 


Putative alcohol dehydrogenase 


Danaus plexippus 


EHJ73729 


230 


8E-72 


68% 



Table 3 Putative pheromone biosynthesis related genes in the A. ipsilon pheromone gland (Continued) 



Aldehyde reductase 



Unigene_12563-AR 


KC007346 


849 


45 


putative aldo-ketosereductase 1 


Pa pi Ho dardanus 


CAW30924 


394 


2E-134 


70% 


Unigene_1274-AR 


KC007347 


945 


34 


a I do-keto reductase 


Bombyx mori 


ADQ89807 


491 


2E-168 


78% 


Unigene_17351-AR 


KC007348 


930 


43 


aldo-ketoreductase 2E 


Bombyx mori 


BAL70378 


342 


1E-113 


58% 


Unigene_1774-AR 


KC007349 


954 


37 


aldo-ketoreductase 


Helicoverpa armigera 


AEB26313 


577 


0.0 


88% 


Unigene_3134-AR 


KC007350 


975 


35 


putative aldo-ketoreductase 


Donous plexippus 


EHJ71186 


410 


9E-140 


66% 


Unigene_4806-AR 


KC007351 


726 


9 


aldehyde reductase 1 


Culex quinquefasciatus 


XP_00 1844836 


302 


1E-98 


64% 


Unigene_5103-AR 


KC007352 


598 


8 


aldo-ketoreductase 


Heliothis virescens 


ACX53715 


296 


2E-96 


72% 


Unigene_7337-AR 


KC007353 


456 


5 


aldo-ketoreductase 


Heliothis virescens 


ACX53798 


236 


2E-74 


80% 


Unigene_7554-AR 


KC007354 


552 


92 


aldo-ketoreductase, partial 


Popilio xuthus 


BAM20078 


183 


4E-54 


57% 


Unigene_9245-AR 


KC007355 


645 


8 


aldo-ketoreductase 


Papilio xuthus 


BAM 19656 


233 


8E-73 


53% 


Unigene_9786-AR 


KC007356 


963 


28 


aldo-ketoreductase 


Donous plexippus 


EHJ68075 


407 


3E-135 


63% 


cety transferase 




















Unigene_173-ATF 


KC007357 


795 


43 


acetyltransferase 1 


Donous plexippus 


EHJ65205 


317 


1E-145 


71% 


Unigene_407-ATF 


KC007358 


855 


196 


acyltransferase 


Heliothis virescens 


ACX53812 


494 


5E-172 


90% 


Unigene_553-ATF 


KC007359 


1038 


141 


putative acetyl transferase 


Bombyx mori 


NP_00 1182381 


528 


0.0 


86% 


Unigene_2015-ATF 


KC007360 


552 


67 


acetyltransferase 


Donous plexippus 


EHJ65977 


327 


6E-110 


90% 


Unigene_15362-ATF 


KC007361 


1010 


69 


putative acetyl-CoA acetyltransferase 


Donous plexippus 


EHJ68573 


444 


3E-150 


74% 



*Gene names as deposited in GenBank. § Genbank accession number of the homologous gene. ACQ Acetyl-CoA carboxylase, FAS Fatty acid synthase, DES Desaturase, FAR Fatty acyl reductase, AOX alcohol oxidase, AR 
Aldehyde reductase, ATF Acetyltransferase. 



Table 4 Candidate esterase genes likely involved in A. ipsilon pheromone degradation 
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Spsciss 


nLLCjJlvl 1 IIUIII. 


jluic ^uiioy 




% IHpntitv 
/u luciiuiy 


AipsCXEl 


Unigene 8856 


JX866730 


459 


4 


esterase 


Spodoptera littorolis 


ABH01081 


219 


9E-83 


75% 


AipsCXE2 


Unigene 1631 


JX866731 


1 148 


22 


antennal esterase CXE2 


Spodoptera littorolis 


ACV60229 


475 


5E-161 


67% 


AipsCXE3 


Unigene 1377 


JX866732 


969 


20 


antennal esterase CXE3 


Spodoptero littorolis 


ACV60230 


518 


2E-178 


82% 


AipsCXE4 


Unigene 7378 


JX866733 


504 


6 


antennal esterase CXE4 


SpodoptQFQ littorolis 


ACV60231 


252 


6E-78 


77% 




U I HLj cl \<c_^tZ I D 


JAOOO/ 




1 3 


dlllclllldl cblcldbc LAlj 


jpUUUpLciU IILLUlUllb 


A^-VOUZ JZ 




n n 
u.u 


9.7 OA 

0/ /o 


AipsCXE6 


Unigene_4686 


JX866735 


554 


7 


antennal esterase CXE6 


Spodoptera littorolis 


ACV60233 


229 


2E-69 


75% 


AipsCXE7 


Unigene_9837 


JX866736 


746 


5 


antennal esterase CXE7 


Spodoptera littorolis 


ACV60234 


338 


7E-110 


70% 


AipsCXE8 


Unigene_4231 


JX866737 


907 


11 


antennal esterase CXE8 


Spodoptera littorolis 


ACV60235 


381 


5E-126 


69% 


AipsCXE9 


Unigene_7661 


JX866738 


512 


6 


antennal esterase CXE9 


Spodoptera littorolis 


ACV60236 


226 


5E-68 


64% 


AipsCXEl 0 


Unigene_68 


JX866739 


1674 


54 


antennal esterase CXE10 


Spodoptera littorolis 


ACV60237 


387 


6E-123 


41% 


AipsCXEl 1 


Unigene_1628 


JX866740 


336 


9 


antennal esterase CXE1 1 


Spodoptera littorolis 


ACV60238 


160 


1E-74 


83% 


AipsCXEl 2 


Unigene_4126 


JX866741 


444 


16 


antennal esterase CXE12 


Spodoptera littorolis 


ACV60239 


192 


3E-53 


50% 


AipsCXEl 3 


Unigene_15183 


JX866742 


1843 


25 


antennal esterase CXE13 


Spodoptera littorolis 


ACV60240 


974 


0.0 


83% 


AipsCXEl 4 


Unigene_7537 


JX866743 


451 


5 


antennal esterase CXE14 


Spodoptera littorolis 


ACV60241 


256 


5E-80 


85% 


AipsCXEl 5 


Unigene_5546 


JX866744 


1380 


19 


antennal esterase CXE15 


Spodoptera littorolis 


ACV60242 


498 


5E-167 


52% 


AipsCXEl 6 


Unigene_7483 


JX866745 


474 


6 


antennal esterase CXE16 


Spodoptera littorolis 


ACV60243 


268 


3E-84 


77% 


AipsCXE20 


Unigene_5325 


JX866746 


622 


8 


antennal esterase CXE20 


Spodoptera littorolis 


ACV60247 


343 


2E-112 


81% 



*Gene names as deposited in GenBank. § Genbank accession number of the homologous gene. CXE carboxylesterase. 



Table 5 Candidate olfactory genes involved in A. ipsilon pheromone reception 



Gene Name* 


Unigene 


Accession num. 


Length (AA) 


Signal peptide 


No. of reads 


Putative identification 


Species 


Accession num. § 


Score (bits) 


E-value 


% Ident 


Chemosensory proteins 






















AipsCSPI 


Unigene_468 


JX863696 


124 


1-16 aa 


21 


chemosensory protein 


Heliothis virescens 


ACX53825 


130 


5E-36 


59% 


AipsCSP2 


Unigene_1704 


JX863697 


119 


1-16 aa 


117 


chemosensory protein 


Papili oxuthus 


BAF91716 


159 


8E-48 


66% 


AipsCSP3 


Unigene_1767 


JX863698 


128 


1-18 aa 


19 


chemosensory protein 


Momestro brossicoe 


AAF71290 


223 


2E-72 


83% 


AipsCSP4 


Unigene_2047 


JX863699 


120 


1-16 aa 


46 


chemosensory protein 2 


Helicoverpa armigera 


AEX07265 


222 


2E-72 


86% 


AipsCSP5 


Unigene_6052 


JX863700 


107 


1-18 aa 


16 


chemosensory protein 


Do no us plexippus 


EHJ67380 


186 


3E-55 


84% 


AipsCSP6 


Unigene_15000 


JX863701 


127 


1-18 aa 


13 


chemosensory protein 2 


Heliothis virescens 


AAM77040 


227 


3E-74 


87% 


AipsCSP7 


Unigene_15308 


JX863702 


128 


1-16 aa 


18 


chemosensory protein 


Heliothis virescens 


ACX53804 


211 


5E-68 


75% 


AipsCSP8 


Unigene_721 


JX863703 


123 


1-18 aa 


390 


chemosensory protein 


Heliothis virescens 


ACX53806 


206 


3E-66 


76% 


Odorant binding proteins 






















AipsOBPI 


Unigene_520 


JX863689 


183 


ND 


44 


odorant binding protein 


Heliothis virescens 


ACX53761 


197 


2E-60 


52% 


AipsOBP2 


Unigene_2120 


JX863690 


148 


1-21 aa 


44 


pheromone binding protein 4 


Momestro brossicoe 


AAL66739 


241 


6E-77 


79% 


AipsOBP3 


Unigene_6517 


JX863691 


108 


ND 


10 


odorant-binding protein 19 


Helicoverpa ormigero 


AFM93773 


127 


4E-34 


53% 


AipsOBP4 


Unigene_8860 


JX863692 


121 


ND 


4 


antennal binding protein 


Heliothis virescens 


CAC33574 


173 


7E-53 


65% 


AipsOBP5 


Unigene_15218 


JX863693 


137 


1-16 aa 


20 


odorant binding protein 


Heliothis virescens 


ACX53795 


193 


2E-60 


70% 


AipsOBP6 


Unigene_1571 1 


JX863694 


125 


ND 


5 


odorant binding protein 


Heliothis virescens 


ACX53743 


243 


7E-80 


86% 


AipsOBP7 


Unigene_15861 


JX863695 


145 


1-23 aa 


4 


odorant binding protein 3 


Helicoverpa ormigero 


AEB54582 


97.8 


9E-23 


38% 



*Gene names as deposited in GenBank. § Genbank accession number of the homologous gene. ND, not detected signal peptide because the N-terminus is missing. CSP chemosensory protein, OBP odorant 
binding protein. 
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Figure 3 The abundance of the unigenes encoding the sex pheromone synthase in the A. ipsilon transcriptome dataset presented as 
normalized read count in reads per kilobase per million mapped reads (RPKM). The putative enzyme names are indicated as gene abbreviations 
followed by Genbank accession numbers. ACC Acetyl-CoA carboxylase, AOX Alcohol oxidase, AR Aldehyde reductase, ATF Acetyltransferase, DES 
Desaturase, FAR Fatty acyl reductase, FAS Fatty acid synthase. 



[38]. At least three active pheromone components (Z7-12: 
OAc, Z9-14:OAc and Zll-16:OAc) have been identified in 
A. ipsilon strains from China [38], North America [45], 
France [46] and Japan [47]. It is reasonable to propose that 
the saturated fatty acid precursor of A. ipsilon sex phero- 
mones would be palmitic acid (16:0) which is desaturated 
by All-desaturase to form the precursor Zll-16:acyl-CoA 
for the production of two major (Z7- 12:OAc and Z9-14: 
OAc) and two minor (Zll-16:OAc and Z5-10:Ac) phero- 
mone components (Figure 7). It is not clear how the minor 
pheromone component (Z8-12:OAc) is synthesized in A. 
ipsilon, which should involve a A12-desaturase. Other 
studies in Lepidoptera species support a All -desaturase 



acting on palmitic acid and leading to the production of the 
sex pheromone components [19,20,48]. In the A, ipsilon 
pheromone gland transctiptome 5 transcripts have high 
homology to genes encoding desaturases (Table 3). DES- 
JX9891S2 is homologous to a gene encoding an acyl-CoA 
A9-desaturase in M. brassicae (Protein ID: ABX90048) with 
an amino acid identity of 96%. A9-desaturase makes oleic 
acid from stearic acid (18:0) and possibly palmitoleic acid 
from palmitic acid [16,17,49]. It would not participate in 
the biosynthesis of A. ipsilon sex pheromones. DES- 
JX989153 encodes a protein with 87% amino acid identity 
with the acyl-CoA All desaturase of M. brassicae (Protein 
ID: ABX90049). DES-JX989154, DES-JX9891SS and DES- 
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Figure 4 The abundance of unigenes encoding chemosensory proteins (CSPs), odorant-binding proteins (OBPs) and esterase (EST) in 
the A. ipsilon transcriptome dataset presented as normalized reads in reads per kilobase per million mapped reads (RPKM). 



JX989156 encode proteins, respectively, with 94% amino 
acid identity to the acyl-CoA desaturase from H. assulta 
(Protein ID: AF482909), 64% amino acid identity to a 5. 
littoralis desaturase (Protein ID: AAQ74260) and 93% 
amino acid identity to an acyl-CoA desaturase of S. exigua 
(Protein ID: AAM28510). These transcripts could possibly 
encode A12-desaturases in A. ipsilon in formation of the 
minor pheromone component Z8-12:OAc from the precur- 
sor Z12-16:acyl-CoA. However, they could also function as 
A9-desaturase. Further study on their enzyme activity 
could confirm their role in the sex pheromone biosyn- 
thesis. The RT-PCR and qRT-PCR results indicated that 
DES-JX9891S3 and DES-JX989154 are highly expressed 
in the A. ipsilon pheromone gland compared with the 
body (85 and 63 fold higher, respectively) (Figure 5 and 
Figure 6). One of the transcripts (DES-JX989154) is also 



highly abundant (1206 RPKM) in the pheromone gland 
transcriptome (Figure 3), suggesting a possible role in 
A. ipsilon sex pheromone biosynthesis. 

Fatty acyl-CoA reductase (FAR) 

Once a specific All and possibly A12 double bond is 
introduced into fatty acid precursors to form a fatty 
acyl-CoA precursor, the chain of the precursors is then 
shortened sequentially by E-oxidation to form different 
shorter chain fatty acyl-CoA precursors [6]. These pre- 
cursors are further reduced individually by fatty acyl 
reductase (FAR) to form corresponding fatty alcohols 
[26,28,51]. In the A. ipsilon pheromone gland transcrip- 
tome there are 13 transcripts homologous to putative 
FAR genes (Table 3). Among them, 5 transcripts encode 
proteins with 59%-80% amino acid identity to the fatty- 
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Figure 5 RT-PCR results showing the relative expression of the A. ipsilon pheromone biosynthesis-related genes in pheromone gland 
(PG) and the body (BO). The genes that are more highly expressed in the pheromone gland are labeled with red pentagram, fi-actin was used 
as internal reference gene to test the integrity of each cDNA templates; the similar intensity of p-octin bands between the pheromone gland and 
the body part indicate the use of equal template concentrations. 



acyl CoA reductases of Ostrinia nubilalis (Protein IDs: 
ADI82776, ADI82777, ADI82778 and ADI82779). Other 
FAR transcripts are homologous to the fatty acyl-CoA 
reductase from a wide range of insect species including 
H. virescens, N. vitripennis, Danaus plexippus, Bombus 
terrestris and Apis mellifera with amino acid identities of 
about 60% (Table 3). The RT-PCR and qRT-PCR results 
indicated that three transcripts (FAR-JX989157, FAR- 
JX989162 and FAR-JX989164) are highly expressed in 
the pheromone gland (Figure 5 and Figure 6). The other 
ten transcripts seem equally expressed in the pheromone 
gland and the body or highly expressed in the body. All 
FAR transcripts except two (FAR-JX989157 and FAR- 
JX9891S9) have low abundance (from 81 and 16 RPKM) 
in the pheromone gland transcriptome (Figure 3). 

Alcohol oxidase/dehydrogenase (AOX) 

Fatty alcohols can be used as pheromone components 
in many moth species, and they are also pheromone 
intermediates to produce aldehyde pheromones by the 
alcohol oxidases [52,53]. In the A. ipsilon PG 5 homolo- 
gous genes of alcohol oxidase/dehydrogenase were 
identified, the BLASTx results revealed three unigenes 
(AOX-KC007341, AOX-KC007342 and AOX-KC007344) 
are with the amino acid identity of 43%, 55% and 64%, 
respectively, to a putative alcohol dehydrogenase of 
D. plexippus (Protein ID: EHJ70611), and one unigene 
(AOX-KC007345) are homologous to another putative al- 
cohol dehydrogenase of D. plexippus (Protein ID: 
EHJ73729 ) with the amino acid identity of 68%. AOX- 
KC007343 showed 78% amino acid identity with the 
alcohol dehydrogenase of H. virescens (Protein ID: 



ACX53694). The RT-PCR and qRT-PCR results indicated 
that AOX-KC007341 and AOX-KC007343 showed a higher 
expressed level in the PG than in the body (Figure 5 and 
Figure 6). 

Aldehyde reductase (AR) 

Aldehyde reductases are members of the aldo- 
ketoreductase superfamily and could be used to reduce 
long-chain acyl-CoA to form alcohol intermediates [13]. In 
the A. ipsilon pheromone gland we identified 11 transcripts 
with homology to the aldo-ketoreductases of Papilio 
dardanus, B. mori, H. armigera, D. plexippus, Culex 
quinquefasciatus, H. virescens and Papilio xuthus (Table 3). 
The derived protein sequences of these 11 transcripts show 
53%-88% amino acid identity with their homologs in other 
insects. The RT-PCR and qRT-PCR results indicated that 
AR-KC007350 and AR-KC007351 are mainly expressed in 
the pheromone gland, while the other 9 putative aldehyde 
reductase transcripts have equal expression levels between 
the pheromone gland and the body or a higher expression 
level in the body (Figure 5 and Figure 6). All aldehyde 
reductase transcripts are present at low abundance (from 
67 to 10 RPKM) in the pheromone gland transcriptome 
(Figure 3). The involvement of aldehyde reductase in sex 
pheromone biosynthesis has not been demonstrated in 
moth species. 

Acetyltransferase (ATF) 

The fatty acid alcohols are used as pheromone components 
in many moth species. In A. ipsilon whose sex pheromone 
blends comprise only acetates, they are intermediates and 
acetylated to pheromone components as acetate esters by 



Gu et al. BMC Genomics 2013, 14:636 
http://www.biomedcentral.com/1471 -21 64/1 4/636 



Page 13 of 21 




Body PG £ Body PG £ Body PG 

ACC-JX989 1 49 ACC-JX989 1 50 FAS-JX989 1 5 1 




Body PG 
DES-JX989153 




Body PG 
FAR-JX989162 




b_ 

Body PG 
DES-JX989154 



2 150 




2 0 



Body PG 
FAR-JX989164 




S 0 
o 




Body PG 
FAR-JX989157 



a 

Body PG 



PG 

AOX-KC007341 




AOX-KC007343 AR-KC007350 AR-KC007351 





2 15- 

|» 

i' 

a o 1 

Body PG £ Body PG £ Body 

ATF-KC007357 ATF-KC007358 ATF-KC007360 




Figure 6 (See legend on next page.) 
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(See figure on previous page.) 

Figure 6 qRT-PCR results showing the relative expression levels of the A. ipsilon pheromone biosynthesis related genes between the 
pheromone gland (PG) and the body (BO). The putative enzyme names are indicated as gene abbreviations followed by Genbank accession 
numbers. ACC Acetyl-CoA carboxylase, FAS Fatty acid synthase, DES Desaturase, FAR Fatty acyl reductase, AOX alcohol oxidase, AR Aldehyde 
reductase, ATF Acetyltransferase. The internal control (3-actin and ribosomal protein S3 were used to normalize transcript levels in each sample. 
This figure was presented using (3-actin as reference gene to normalize the target gene expression and correct sample-to-sample variation; similar 
results were also obtained with ribosomal protein S3 as reference gene. The standard error is represented by the error bar, and the different letters 
(a, b) above each bar denote significant differences (p<0.05). 

V ) 



actyltransferases [13]. In the A. ipsilon pheromone gland 
transcriptome 5 acetyltransferase homologous transcripts 
were identified (Table 3), 3 of them (ATF-KC007357, ATF- 
KC007360 and ATF-KC007361) encode proteins that are 
homologous to the acetyltransferase of D. plexippus (Pro- 
tein IDs: EHJ65205, EHJ65977 and EHJ68573) with rela- 
tively high amino acid identities (>70%), one (ATF- 
KC007358) encodes a protein with 90% amino acid identity 
to H. virescens acetyltransferase (Protein ID: ACX53812) 
and one (ATF-KC0073S9) encodes a protein with 86% 
amino acid identity with the acetyltransferase of B. mori 
(Protein ID: NP_001 182381). The RT-PCR and qRT-PCR 
revealed that three transcripts (ATF-KC007358, ATF- 
KC007360 and ATF-KC007357) are mainly expressed in the 
pheromone gland (Figure 5 and Figure 6) and have a rela- 
tive high abundance of 195, 155 and 71 RPKM, respectively 
in the pheromone gland transcriptome (Figure 3). 

Genes encoding candidate pheromone degrading 
enzymes in the A. ipsilon pheromone gland 

It would be potentially harmful to insects if pheromone 
molecules and other odorants remained on the olfactory re- 
ceptors after they had stimulated the olfactory receptor 
neurons (ORNs). It is therefore thought that there are 
mechanisms to protect the ORNs by odorant degrading en- 
zymes (ODEs) [37] including esterases [54,55], aldehyde ox- 
idases [56-58], cytochromes P450 [59-61], carboxyl esterase 
[62], and glutathione S-transferase (GST) [63]. In this study, 
we identified 17 transcripts predicted to encode esterases in 
the A. ipsilon pheromone gland, and the BLASTx results 



showed that all have very high amino acid identities 
with the antennal esterases of S. littoralis (Table 4), we 
named them as AipsCXEl-AipsCXE16 and AipsCXE20 
following the nomenclature in S. littoralis. Our qRT-PCR 
results revealed that 7 of the transcripts (AipsCXE3, 
AipsCXE7, AipsCXE8, AipsCXE9, AipsCXEll, AipsCXEM 
and AipsCXE20) are antennal-enriched, 3 (AipsCXE5, 
AipsCXElO and AipsCXElS) are both antennal- and phero- 
mone gland-enriched and the remaining 7 (AipsCXEl, 
AipsCXE2, AipsCXE4, AipsCXE6, AipsCXE12, AipsCXE13 
and AipsCXE16) have similar expression levels in antennae, 
body and pheromone gland, suggesting they are not phero- 
mone specific (Figure 8). 

Genes encoding candidate pheromone carrier proteins in 
the A. ipsilon pheromone gland 

Moth sex pheromones are synthesised and protected from 
degradation until being released from the female phero- 
mone gland and it has been proposed that OBPs and CSPs 
could participate in this process. In this study we have 
identified transcripts of 7 OBPs and 8 CSPs from the 
A. ipsilon pheromone gland (Table 5), all of these have the 
typical insect OBP sequence motif Cl-X 15 _39-C2-X 3 -C3- 
X2i_44-C4-X 7 _ 12 -C5-X 8 -C6 [35,64] or CSP sequence motif 
Ci-X 6 _ 8 -C2-X 16 _2i-C3-X 2 -C 4 [65]. One CSP transcript, 
AipsCSP2 seems to be gland-specific and has an extremely 
high expression level (>100 folds) in the pheromone 
glands compared with the antennae and body and a rela- 
tive high abundance in the pheromone gland transcrip- 
tome. AipsCSP8 shows a higher expression level in the 
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pheromone gland (10-fold higher than in body) (Figure 9) 
and is extremely abundant with 1,364 RPKM in the phero- 
mone gland transcriptome (Figure 4). 

There is one OBP transcript (AipsOBP6) which is highly 
expressed in the pheromone gland (more than 3-fold 
higher than in the antennae), and 3 OBPs {AipsOBPl, 
AipsOBP2 and AipsOBP4) are highly expressed in the an- 
tennae (Figure 10). This high expression of OBPs and 
CSPs in the pheromone gland is interesting because it sug- 
gests a possible involvement in carrying and releasing sex 
pheromones as demonstrated for the antennal OBPs and 
CSPs. However, the molecular mechanisms that connect 



these proteins with the involvement of pheromone pro- 
duction needs further investigation. No ORs, IRs and 
SNMPs are identified in the A. ipsilon pheromone gland. 

Conclusions 

The black cutworm A. ipsilon is a destructive pest of many 
crops [66,67] and mainly controlled by chemical pesticides, 
which has led to the development of resistance to various 
compounds [68]. Our study provides information and 
resource to identify and facilitate functional studies of genes 
responsible for pheromone production, transport and 
degradation at the molecular level both in vivo and in vitro. 
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Figure 9 qRT-PCR results showing the relative expression of the A. ipsilon unigenes encoding putative chemosensory proteins (CSP) 
identified in the pheromone gland in the male antennae (MA), the female antennae (FA), the body (BO) and the pheromone gland 
(PG). The standard error is represented by the error bar, and the different letters (a, b) above each bar denote significant differences (p <0.05). 



By deep sequencing of the A. ipsilon sex pheromone 
gland transcriptome, we have identified 42 transcripts 
encoding enzymes putative involved in pheromone pro- 
duction. This is the first study reporting the key enzyme 
All -desaturase involved in A. ipsilon sex pheromone 
biosynthesis. One new transcript (DES-JX989154) encod- 
ing a desaturase is highly abundant in the transcriptome 
and highly expressed in the pheromone gland, sugg- 
esting this desaturase encoded by DES-JX989154 or 
other newly identified transcripts (DES-JX989155 and 



DES-JX989156) may play important roles in A. ipsilon 
sex pheromone biosynthesis. They may contribute in the 
introducing a double bond at Cll and C12 positions of 
the saturated fatty acid precursor palmitic acid for the 
production of pheromone precursors. Further studies 
are needed to confirm the substrates and the products 
thus the involvement of these desaturases and other 
newly identified genes such as those encoding for alde- 
hyde reductases and acetyltransferases in A. ipsilon sex 
pheromone biosynthesis. Two of the CSPs are highly 
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Figure 10 qRT-PCR results showing the relative expression of the A. ipsilon unigenes encoding putative odorant binding proteins 
(OBP) identified in the pheromone gland in the male antennae (MA) 7 the female antennae (FA) 7 the body (BO) and the pheromone 
gland (PG). The standard error is represented by the error bar, and the different letters (a, b, c) above each bar denote significant 
differences (p < 0.05). 
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abundant transcripts (AipsCSP2 and AipsCSP8) with 100- 
and 10-fold higher transcription level, respectively than in 
the body. Furthermore AipsCSP2 and AipsOBP6 are 
pheromone gland-specific and -enriched, respectively 
(Figure 9 and Figure 10). This suggests a functional role of 
the PG-enriched CSPs and OBPs in sex pheromone trans- 
port and release. It is clear that during perireceptor event 
after pheromones and odorants enter the sensillun lymph 
that the antennae-specific odorant binding proteins 
(OBPs) capture these hydrophobic pheromone and odor- 
ant and deliver them to the membrane-bound olfactory 
receptors (ORs) [35]. Further study of these PG-expressed 
OBPs, especially their binding to sex pheromone compo- 
nents is needed to confirm its function. 

Methods 

Insect material 

The A. ipsilon colony has been reared in our laboratory 
(State Key Laboratory for Biology of Plant Diseases and 
Insect Pests, Chinese Academy of Agricultural Sciences, 
Beijing, China) since 2006 with field-collected moths intro- 
duced each summer to prevent inbreeding effects. The 
larvae were reared on an artificial diet comprising wheat 
germ, casein and sucrose as the main components. The 
colony was kept at 24°C with 75% relative humidity and a 
14h:10h light:dark photoperiod. Pupae were sexed and kept 
separately in hyaline plastic cups before emergence. Adult 
moths were given 20% honey solution after emergence. 

Pheromone gland dissection 

The pheromone gland plus associated ovipositor valves 
and parts of the terminal abdominal segments were dis- 
sected with fine scissors [39] from the rest of the body 
parts refereed as 'body which comprises of heads, thor- 
axes, legs, wings and abdomens (without the pheromone 
glands). The calling behavior of female A. ipsilon moths 
begins on the first night after eclosion and increases 
sharply, peaking on the third night [38]. So in order to 
cover all genes involved in pheromone biosynthesis, four 
glands of 1 -day-old females, four glands of 2-day-old fe- 
males and ten glands of 3-day-old females were dis- 
sected during the second half of the scotophase, which is 
reported to be the calling period of this moth [69-71]. 
The eighteen glands were mixed in one RNase-free cen- 
trifuge tube for total RNA extraction and frozen in li- 
quid nitrogen until further processing. 

RNA extraction and cDNA library construction 

Total RNA was extracted using TRIzol regent (Invitrogen, 
Carlsbad, CA, USA) according to the manufactu- 
rers protocol. The quantity of RNA was determined 
using a Nanodrop spectrophotometer (Thermo Scientific, 
Wilmington, DE, USA) and 1.1% agarose gel electrophoresis. 
About 500 ng mRNA was further purified from 50 \xg total 



RNA using the polyATtract mRNA isolation system III 
(Promega, Madison, WI, USA). The mRNA was then 
sheared into about 800 nucleotides using a RNA fragmenta- 
tion solution (Autolab, Beijing, China) at 70°C for 30 sec, 
and then cleaned and condensed using RNeasyMinElute 
RNA Cleaning Up kit (Qiagen, Valencia, CA, USA). The 
mRNA was used as a template for first-strand cDNA synthe- 
sis using N6 random primers and MMLV reverse tran- 
scriptase (TaKaRa, Dalian, China) and the second strands 
were synthesized using Secondary Strand cDNA synthesis 
enzyme mixtures (Autolab, Beijing, China). cDNAs with 
appropriate length were purified with the QIAquick PCR 
Purification kit (Qiagen, Valencia, CA, USA) and eluted 
with 10 \A Elution Buffer. After blunt ending and the 
addition of a poly-A tail at the 3' end according to the 
Roches Rapid Library Preparing protocols (Roche, USA), 
the purified cDNAs were linked to GS-FLX sequencing 
Adaptors (Roche, USA). Finally, the cDNAs shorter than 
500 bp were removed using Ampure Beads according to 
the manufactures' instruction (Beckman, USA) before the 
preparation of the cDNA library. 

454 sequencing 

Pyrosequencing of the cDNA library was performed by 
Beijing Autolab Biotechnology Company using a 454 
GS-FLX sequencer (Roche, IN, USA). All sequencing 
reads were deposited into the Short Read Archive (SRA) 
of the National Center for Biotechnology Information 
(NCBI) under the accession number SRX189143. 

Sequence analysis and assembly 

Base calling of the raw 454 reads in SFF files was carried out 
using the python script sff_extract.py developed by COMAV 
(http://bioinf.comav.upv.es). All raw reads were then 
processed to remove low quality and adaptor sequences 
using programs tagdust [72], LUCY [73] and SeqClean 
[74] with default parameters. The resulting sequences 
were then screened against the NCBI UniVec database 
(http://www.ncbi.nlm.nih.gov/VecScreen/UniVec.html) to 
remove possible vector sequence contamination. Cleaned 
reads shorter than 60 bases were discarded because they 
are likely to be sequencing artifacts [75]. 

Two steps were taken to assemble the clean reads. First 
MIRA3 [76] was used with the assembly settings of mini- 
mum sequence overlap of 30 bp and minimum percentage 
overlap identity of 80%. Then CAP3 was used with assem- 
bly parameters of overlap length cutoff >30 and overlap 
percent identity cutoff >90% [77]. The resulting contigs 
and singletons of more than 100 bases were retained as 
unigenes and annotated as described below. 

Homology searches and functional classification 

Following the assembly, homology searches of all unigenes 
were performed using BLASTx and BLASTn programs 
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against the GenBank non-redundant protein (nr) and nu- 
cleotide sequence (nt) database at NCBI [78]. Matches with 
an E-value less than 1.0E-5 were considered significant 
[79]. Gene names were assigned to each unigene based on 
the best BLASTx hit with the highest score value. 

Gene Ontology terms were assigned by Blast2GO [80] 
through BLASTx program with an E-value less than 1.0E-5. 
Then, WEGO [81] software was used for assignment of each 
GO ID to the related ontology entries. The longest open 
reading frame (ORF) of each unigene was determined by an 
ORF finder tool (http://www.ncbi.nlm.nih.gov/gorf/gorf.html). 

Pheromone gland ESTs from other insects 

The H. virescens pheromone gland ESTs (14112 with 
accession number: GR958232-GR972305, GT067784- 
GT067747) [39], the A. segetum pheromone gland ESTs 
(2286 with accession number: ES582156-ES584441) [82] 
and the B. mori pheromone gland ESTs (10501 with 
accession number: BP184340-BP182009; AV404455- 
AV403746; DC552314-DC544856) were downloaded 
from the dbEST database at NCBI (http://www.ncbi.nlm. 
nih.gov/nucest) and saved as fasta files. All the EST se- 
quences were assembled using the CAP3 program with 
the same parameters as used in the A. ipsilon assembly. 
The comparative analyses of A. ipsilon, H. virescens, B. 
mori and A. segetum pheromone gland unigenes were 
performed based on the Best Bidirectional Hits results 
(reciprocal BLASTn, E-value less than 1.0E-6). 

Identification of candidate genes associated with moth 
pheromone biosynthesis 

Some putative genes and enzymes have been reported 
previously as being involved in moth sex pheromone 
production. We focused our research on the target genes: 
(1) Acetyl-CoA carboxylase; (2) Fatty acid synthase; (3) 
Desaturase; (4) Fatty acyl reductase; (5) Alcohol oxidase; 
(6) Aldehyde reductase; (7) Acetyltransferase. 

Identification of putative genes involved in pheromone 
degradation 

Since the sex pheromone blend of A. Ipsilon is comprised 
of acetate esters (Z)-7-dodecenyl acetate (Z7-12:Ac) 
(40.5%), (Z)-9-tetradecenyl acetate (Z9-14:Ac) (13.2%), 
(Z)-ll-hexadecenyl acetate (Zll-16:Ac) (14.9%), (Z)-8- 
dodecenyl acetate (Z8-12:Ac) (17.2%) and (Z)-5-decenyl 
acetate (Z5-10:Ac) (14.3%) [38], esterases may play a major 
role in pheromone degradation. Therefore, we performed 
BLASTx and BLASTn searches to identify candidate ester- 
ase genes in the A, ipsilon pheromone gland NGS dataset. 

Identification of putative genes involved in pheromone 
transport 

Genes encoding odorant binding proteins (OBPs) and 
chemosensory proteins (CSPs) were identified using the 



"OBP sequence motif Cl-X 15 _3 9 -C2-X3-C3-X 21 _4 4 -C4- 
X 7 _ 12 -C5-X 8 -C6 [64] and the "CSP sequence motif 
C 1 -X 6 _ 8 -C 2 -X 16 _2i-C3-X 2 -C4, [65]. Candidate olfactory 
receptors (ORs), ionotropic receptors (IRs), sensory 
neuron membrane proteins (SNMPs) genes were identi- 
fied by BLASTx and BLASTn searches. 

Sequence analyses 

The putative N-terminal signal peptides and most likely 
cleavage sites were predicted by the SignalP V3.0 
program [83] (http://www.cbs.dtu.dk/services/SignalP/). 
Sequence alignments were done with ClustalX 1.83 [84] 
with default gap penalty parameters of gap opening 10 
and extension 0.2. 

RT-PCR and qRT-PCR 

The cDNAs from female pheromone glands and other 
body parts (mixture of heads, thoraxes, legs, wings and 
abdomens (without the pheromone glands)) were synthe- 
sized using PrimeScript RT Reagent with gDNA Eraser 
(TaKaRa, Dalian, China). 200 ng cDNA was used as 
RT-PCR and qRT-PCR templates. Specific primer pairs for 
RT-PCR analysis were designed with Primer 3 (http://frodo. 
wi.mit.edu/) or Primer Premier 5 (see Additional file 4). 
To test the integrity of the cDNA templates, a pair of con- 
trol primers for the |3-actin (GenBank Acc. JQ822245) of 
A. ipsilon was used. The PCR cycling profile was: 95°C for 
2 min, followed by 35 cycles of 95°C for 30 sec, 60°C for 
30 sec, 72°C for 1 min and a final extension for 10 min at 
72°C. PCR products were separated in 1.2% agarose gels 
and stained with ethidium bromide. Each reaction was 
done at least six times with three biological replicates. 

qRT-PCR analysis was conducted using the ABI 7500 
Real-Time PCR System (Applied Biosystems, Carlsbad, 
CA). The primers were designed by Beacon Designer 7.90 
(PREMIER Biosoft International) (see Additional file 5). 
Two reference genes, fi-actin (GenBank Acc. JQ822245) 
and ribosomal protein S3 (GenBank Acc. JQ822246) were 
used for normalizing expression of the target gene and 
correcting for sample-to-sample variation. qRT-PCRs were 
done in a 25 \A reaction containing 12.5 \A of Platinum 
SYBR Green qPCR SuperMix-UDG (Invitrogen, Shanghai, 
China), 0.5 \A of each primer (10 pmol/ |il), 0.5 ul of Rox 
Reference Dye, 1 [A of sample cDNA (200 ng/|il), 10 [A of 
sterilized H 2 0. The cycling parameters were: 50°C for 
2 min, 95°C for 2 min, followed by 40 cycles of 95°C for 15 
sec and 60°C for 30 sec. Then, the PCR products were 
heated to 95°C for 15 sec, cooled to 60°C for 1 min and 
heated to 95°C for 30 sec and cooled to 60°C for 15 sec to 
measure the dissociation curves. Negative controls, with- 
out either template or transcriptase, were included in each 
experiment. To check reproducibility, each qRT-PCR reac- 
tion for each sample was carried out in three technical 
replicates and three biological replicates. 
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qRT-PCR data analysis 

Relative quantification was performed using the com- 
parative 2" AACt method [85]. All data were normalized to 
endogenous fi-actin or ribosomal protein S3 levels from 
the same individual samples. In the analysis of the rela- 
tive fold change in different tissues, the body sample was 
taken as the calibrator. Thus, the relative fold change in 
different tissues was assessed by comparing the expres- 
sion level of each target gene in other tissues to that in 
the body part. The results are presented as the mean of 
the fold change in three biological samples. The com- 
parative analyses of each OBP, CSP and CXE gene 
among different tissues were determined with one-way 
nested analysis of variance (ANOVA), followed by a 
Tukeys honestly significance difference (HSD) test using 
SPSS Statistics 18.0 (SPSS Inc., Chicago, IL, USA). The 
comparative analyses of each putative pheromone bio- 
synthesis gene between pheromone gland (PG) and body 
part were determined with paired £-test. When applic- 
able, values were presented as mean ± SE. 
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